Music has always been a very important part of our lives, and we can easily fall in love with songs without knowing the lyrics. However, a well-written lyric should always be adored as it is a primary tool for the listeners to connect with the songwriter. Being able to understand the words of the lyrics, we can access what is in their mind. It is so intuitive, and we do not need to be trained to read it. Then, what can lyrics tell us? In this context, we will analyze a large set of lyrics and discover some potential information through data visualization. In this report, we will narrow down to two main characterstics: the genre and the decades when the songs were published.
Before we start, let us take a brief look at what is provided. We will mainly study the genre of the songs and the period of the time it published.
Notably, a large proportion of the songs came up in the 2000s and most of the songs are rock music. Why is this happening? How about the proportion of the genres within each decade? A mosaic plot will provide a more detailed description. It plots the genres of the songs against the decades when they were published. According to the plot, Rock music accounts for a large proportion of the total music genre across all decades, and fewer R&B songs were published proportionally after the year 2000. In the 1980s, lots of hip-hop music came up, marked a hip-hop era. Later, in the 2010s, pop music arose, and the share of rock music keeps going down throughout these years.
Lyrics are combinations of words. The lyricists choose the words very carefully, as they place their hopes and thoughts in these sentences. They are perfect mirrors for their spirit. Then, what are the keywords across all genres of the songs and did the keywords change though out the year? I count the frequency of words appeared in the lyrics with respect to our interests, and consider the most frequent word is the main topic. From the previous discussion, we noticed that rock music is a major part of the song development, and let us take a look at rock music by generating a wordcloud.
We can tell that love is a major topic in rock music, and rockers also talk about time and ill, with a lot of use of the pronoun you. Rock music deals with a wide range of topics and themes. From our plot, pop music arose in the 2010s, and we can take a look at how they differ from rock music in terms of the word use.
The topic of lyrics did not change dramatically, but pop music uses baby and heart more often compared with rock music, as pop is softer and has a simpler structure.
Music is like an infant, and it develops through the years. The topic has been changing in different historic times. The 2000s is a decade for the song wave, and plentiful songs came up. What are the keywords for two-thousands? How about now?
> 2000s
> 2010s
We can tell that the keywords are very consistent in the 2000s and 2010s. People talked about love, time, baby, and ill. Love is always a big word for all time.
Lyric is a way for the lyricists to transfer their emotions. The ability of recognizing the basic emotions, like happiness and sadness is a common skill that we were born with. First, we are going to analyze the sentiments in the lyrics with the NRC word-emotion lexicon. It is a collection of words associated with eight emotions (anger, fear, anticipation, trust, surprise, sadness, joy, and disgust) and two sentiments (positive and negative) (“NRC Word-Emotion Association Lexicon”, n.d.)
This plot shows that the emotion word counts across from all genres. The darker the bar is, the more word count this genre has. Here, all genres of songs emphasize on two sentiments. The rock music talks less about joy and disgust, as well as pop music; but hip-hop favors trust, fear, and anger. Meanwhile, metal shows a strong favor of negative sentiment and fear emotion.
How about the sentiment counts for each different decades? There are very dark bars in the 2000s, representing positive and negative sentiments, and indeed, these two sentiments are the major emotions for all time. We may notice that for different decades, the distributions of the word counts are quite similar: people prefer trust, sadness, joy, fear, anticipation, as well as anger.
As the two sentiments appear the most, let us concentrate on these two sentiments: positive and negative. These two plots present the value of positive counts minus negative counts, providing an overall sense of how the big emotion in the lyrics.
The first plot is grouped by the time of publishing. The sentiments were very flat before the 1990s, and after the 1990s, people will use strong emotion words to express their thoughts.
The second plot is grouped by the genres of the songs. Note that hip-hop and metal songs have strong favor for negative sentiment, while jazz shows favor for positive sentiment. Also, electronic, folk, and rock music have cases where the lyricists expressed extreme emotions.
After analyzing two sentiments, we will focus on the sentiment scores based on our interest. The sentiment score represents the overall sentiment of the lyrics, and it should be consistent with our analysis above.
sentiment score plot for decades
The first plot represents the sentiments score for the time of publishing, and we can notice that the points spread out more in the 2000s and 2010s, which means that the songwriters used more emotional words in this period of time.
sentiment score plot for genres
The second plot represents the sentiments score for the genres. The average sentiment scores (red dot) tend to be negative for hip-hop and metal and positive for jazz, which, again, match our analysis above.